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THE BROWNIAN CONTINUUM RANDOM TREE AS THE 
UNIQUE SOLUTION TO A FIXED POINT EQUATION 

MARIE ALBENQUE AND CHRISTINA GOLDSCHMIDT 


Abstract. In this note, we provide a new characterization of Aldous’ Brow¬ 
nian continuum random tree as the unique fixed point of a certain natural 
operation on continuum trees (which gives rise to a recursive distributional 
equation). We also show that this fixed point is attractive. 


Introduction 

The Brownian continuum random tree (BCRT), which was introduced and first 
studied by Aldous mw , is the prototypical example of a random M-tree/continuum 
random tree. Its importance derives from the fact that it is the scaling limit of a 
large class of discrete trees including: all critical Galton-Watson trees with finite 
offspring variance El IS], unordered binary trees m, uniform unordered trees El, 
uniform unlabelled unrooted trees [ 21 ] , critical multitype Galton-Watson trees m 
and random trees with a prescribed degree sequence satisfying certain conditions [8] . 
It is also the scaling limit of random dissections |10j and random graphs from sub- 
critical classes pij . 

Many of these convergence results are proved using some sort of functional coding. 
However, particularly in the case of unordered trees, a natural functional coding 
whose distributional properties are easily understood is not always available. In 
such settings, an alternative approach is desirable. 

By a recursive distributional equation for a random variable X taking values in 
some Polish space 5, we mean an equation of the form 

( 1 ) 1 ), 

where Ai, A 2 ,... are i.i.d. copies of X, independent of the family of random vari¬ 
ables and / is a suitable 5-valued function. We can, of course, think of 

this equation in terms of probability distributions: if fi is the distribution of X and 
E(/i) is the distribution of the right-hand side then /i is a fixed point of the operator 
F. 

For families of random variables which satisfy a natural recursive distributional 
equation, the so-called contraction method has been demonstrated to be a powerful 
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tool for proving convergence results. Suppose that (M„) is a sequence of distribu¬ 
tions for which we wish to prove that there exists a limit M. The basic idea is as 
follows. Suppose that Mn can be described recursively in terms of for m < n. 
This equation often allows one to guess a limiting version, in which M is described 
in terms of itself. In other words, M should be the fixed point of some operator 
Suppose that, in addition, 5^ is a contraction in a suitable metric on the space of 
probability measures. Then Banach’s fixed point theorem tells us that there exists 
a fixed point and that —>■ M as n —>■ oo in the sense of that metric. 

This is straightforward in principle, but usually the recursive equation for does 
not have precisely the same form as the limiting operator. Moreover, finding a 
metric in which 5^ is a contraction (but which also yields weak convergence) is often 
highly non-trivial. In practice, this method has been applied very successfully for 
sequences of random variables (see, for example, miiaiii]), but so far there is only 
one result for the more complicated setting of convergence of stochastic processes 

m- 

It is often the case that families of discrete trees have a recursive definition or 
description. Aldous [S] proved that the BCRT is a fixed point for a natural operation 
on continuum trees. With these two facts in mind, it is natural to ask if a contraction 
method can be established for random trees. This seems an ambitious aim, and 
there are several technical issues to be overcome (not least the choice of metric). 
But our original motivation stems from the fact that, if such a principle were 
to be established, then the characterization of possible limits should be the first 
step. In this article, we prove that the BCRT is the unique fixed point of an 
appropriate operator, and that this fixed point is attractive for a certain natural 
class of measures on continuum trees. 

The rest of this note is organised as follows. In Section 1, we provide an overview of 
the various definitions of the BCRT which already exist in the literature. This also 
enables us to introduce various concepts we will need in the sequel. We then set up 
our fixed point equation. In Section 2, we prove that it has a unique solution. In 
Section 3, we show that repeatedly applying the fixed point operator to any suitable 
law on continuum trees gives convergence to the law of the BCRT in the sense of 
the Gromov-Prokhorov topology. Section 4 contains some concluding remarks. 


1. Overview of definitions of the BCRT 
We begin by introducing the notion of an K-tree. 

Definition 1.1. A compact metric space {T,d) is a K-tree if for all x,y GT 

• there exists a unique geodesic from x to y i. e. there exists a unique isometry 
fx,y : [0,d(a;,y)] -t T such that fx,yiO) = x and fx^y{d{x,y)) = y. The 
image of fx^y is called |a;,j/]; 

• the only non-self-intersecting path from x to y is |x,?/] i.e. if q : [0,1] —>■ T 
is continuous and injective and such that q{0) = x and q{l) = y then 

^([ 0 , 1 ]) = lx,yl- 
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An element x G T is called a vertex. A rooted M.-tree is an K-tree (T, d) with a 
distinguished vertex p called the root. The height of a vertex x is d{p,x). The 
degree deg(a;) of a vertex x is the number of connected components of T \ {a;}. By 
a leaf, we mean a vertex of degree 1; write C{T) for the set of leaves of T. The tree 
T is leaf-dense if T is the closure of C{T). We will often want to endow an K-tree 
with a Borel probability measure (/i, say), which allows us to pick random points 
in the tree. 

A measured metric space {X, d, p) is a complete metric space (X, d) equipped with 
a Borel probability measure p (with respect to the metric d) on A. Define a first 
equivalence relation by declaring two such spaces {X,d,p) and {X',d',p') to be 
GHP-equivalent if there exists an isometry / : A —> A' such that the image of p 
under f is p'. Let S denote the space of GHP-equivalence classes of compact mea¬ 
sured metric spaces. Then S is Polish when endowed with the Gromov-Hausdorff- 
Prokhorov topology [1] . Define a second equivalence relation by declaring (A, d, p) 
and {X',d',p') to be GP-equivalent if there exists an isometry g : supp(/i) —> A' 
such that the image of p under g is p', where supp(/r) denotes the topological 
support of p. Let S' denote the space of GP-equivalence classes of compact mea¬ 
sured metric spaces. Then S' is Polish when endowed with the Gromov-Prokhorov 
topology ^n\ . 

1.1. The BCRT as an K-tree encoded by a Brownian excursion. A standard 
way to generate K-trees is via functional encoding. Suppose that h : [0, oo) —>■ [0, oo) 
is a continuous function of compact support such that h{0) = 0. Use it to define a 
pseudo-metric d by 

d{x,y) = h{x)-\-h[y) — 2 inf h[t), x,y>Q. 

x/\y<t<x\/y 

Define an equivalence relation ~ by letting a; ~ y if d{x, y) = 0. Let T = [0, oo)/ 
denote by r : [0, oo) —> T the canonical projection and let d be the metric induced 
on T by d. If a is the supremum of the support of h then note that r(s) = 0 for 
all s > a. This entails that T = r([0, cr]) is compact. The metric space (T, d) can 
then be shown to be an K-tree (see Le Gall [E]). The tree T can be naturally 
rooted at p = ^(0), the equivalence class of 0, and we will sometimes think of it as 
a rooted object and sometimes not. There is a natural measure p on T given by 
the push-forward of the uniform distribution on [0, a] under the projection r. 

We define the BCRT (T, d) to be the K-tree encoded by 

|o t>l, 

where (e(t), 0 < t < 1) is a standard Brownian excursion. We usually endow (T,d) 
with the probability measure m which is the push-forward of the Lebesgue measure 
on [0,1]. 


1.2. The BCRT as a limit of discrete trees. Let T„ be the ordered rooted tree 
representing the genealogy of a Galton-Watson branching process with offspring 
distribution having mean 1 and variance G (0, oo). Think of as a metric 
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space by endowing it with the graph distance dgr (which puts neighbouring vertices 
at distance 1). Let be the uniform measure on the vertices of T„. Then 

as n —>■ oo, in the Gromov-Hausdorff-Prokhorov sense. (The convergence in distri¬ 
bution is originally due to Aldous [3] , although this formulation is closer to that of 
Le Gall [IS].) 


1.3. The BCRT via random finite-dimensional distributions. We may also 
characterize the BCRT as the unique continuum random tree having certain distri¬ 
butional properties. We must first introduce properly what we mean by a continuum 
tree. 

Definition 1.2. A continuum tree is a triple (T, d, /i) where (T, d) is an (unrooted) 
K-tree and fi is a Borel probability measure on T which is non-atomic and satisfies 

. = 1 ; 

• for every v G T of degree k = deg(?;) >2, let Ti,... ,Tk be the connected 
components ofT\ {u}; then pi{Tj) > 0 for all 1 < i < k. 


The set of continuum trees can naturally be endowed with the Gromov-Hausdorff- 
Prokhorov topology, as briefly discussed at the beginning of this section. 

Definition 1.3. A continuum random tree (CRT) is a random variable taking 
values in the set of continuum trees. 


(In ig, Aldous makes slightly different definitions of these quantities which, in 
particular, use rooted trees and, hence, the pointed Gromov-Hausdorff-Prokhorov 
topology). It will be important in the sequel to observe that, if we consider the 
BCRT to be rooted at the equivalence class of 0 in the Brownian excursion con¬ 
struction, then the root has the same distribution as a uniform pick from m on 
T. 

Given a CRT (T, d, ^), let Ri, V 2 , • ■ ■ be i.i.d. samples from the measure pL. For m > 
2, define the reduced tree TZ(rn) to be the subtree of T spanned by Vi, V 2 , ■ ■ •, Idn- 
For every m > 2, TZ{m) is a discrete tree with edge-lengths and labelled leaves, and 
so its distribution is specified by its tree-shape, t, an unrooted tree with m labelled 
leaves, and its edge-lengths. The reduced trees are clearly consistent, in that TZ{m) 
is a subtree of TZ{m -\- 1). 

Theorem 1.4 (Aldous |5]). The distribution of a CRT (T,d,p,) is specified entirely 
by its random finite dimensional distributions, that is, the distribution ofTZ{m) for 
all m > 2. 


The reduced trees of the BCRT are binary almost surely. This entails that TZ{m) 
has 2m— 2 vertices and 2m — 3 edges. Let t be its tree-shape and Xi,X 2 ,.. ■, X 2 m -3 
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be its edge-lengths listed in any (arbitrary, but fixed) order. Then TZ{m) has density 

( 2m—3 \ / ^ /2m—3 

^ exp -- I ^ Xi 

Note that this implies that the tree-shape is, in fact, uniform on the set of binary 
tree-shapes with m labelled leaves, and that the edge-lengths have an exchange¬ 
able distribution. We observe, for future reference, that the distance between two 
uniformly-chosen points of the BCRT has the Rayleigh distribution, with density 
xe~^ and expectation 

(Note that in [3, Aldous restricts his discussion to binary trees, but the theory is 
easily extended; see Haas and Miermont m) 



1.4. The BCRT as a fixed point. The principal contribution of this paper is a 
characterization of the BCRT as the unique fixed point of a certain operation on 
CRT’s. We need a couple of notational ingredients. We first recall the definition of 
the Dirichlet distribution. 


Definition 1.5. Let ai,a 2 , ■ ■ ■ ,cxn > 0. A random variable taking values in the 
space {s = (si, S 2 ,..., s„) : Si > 0,1 < * < = 1} has the Dirichlet 

distribution with parameters (ai, 02 , • ■ •, «„) (written Dir(ai, 02 , • ■ • > <t „)) if it has 
density 


r(Z]i^l Ctj) ai-l 02-1 

nr=ir(a.) ^ ^ 


Ctn. —1 — 1 
Xn-l 


(1 


-X1-X2 - Xn-lT 


with respect to (n — 1)-dimensional Lebesgue measure. 


1 


Let At be the set of probability distributions on (GHP-equivalence classes of) mea¬ 
sured K-trees. We define U : At —?► At as follows: for M G At, 

• Sample independent trees (Ti,di,/ii), {T 2 ,d 2 , fJ. 2 ), (Xbjdsj/ia) having dis¬ 
tribution M; 

• For 1 < J < 3, pick a vertex Xi G Ti according to the measure isf, 

• Sample A = (Ai, A 2 , A 3 ) ^ Dir(l/2,1/2,1/2) independently; 

• Rescale the trees to obtain (Ti, A^^di, Ai^i), (T 2 , d 2 , ^ 2 ^ 2 ), (Ta, Ag^^da, Aa^a); 

• Identify the vertices Xi , X 2 and Ag in the rescaled trees to obtain a single 
larger tree {T°,d) with a marked branch-point; the three measures Ai/ii, 

A 2 fJ ,2 and Aa/ra naturally give rise to a (probability) measure pL on T°; 

• Forget the marked branch-point in order to obtain {T,d,fj,); 1?(M) is the 
distribution of (T, d, p,). 

The operation on trees given by the function S' was first described by Aldous [B]. 

Let M be the law of the BCRT. Theorem 2 of [B] implies, when rephrased in our 
terms, that M is a solution of M = ^{M). Actually, what is shown in is the 
following statement of the “reverse” of this construction: take a BCRT (T, d, m) 
and pick three points independently according to m; the paths between pairs of 
these points intersect in a unique branch-point. 
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Figure 1. The operator 5^ 


Splitting at this branch-point then gives three BCRT’s, which have been randomly 
rescaled by (Ai,A 2 ,A 3 ) and depend on one another only through this rescaling. 
Moreover the former branch-point yields a point chosen independently from the 
mass measure of each of the three subtrees. (An expanded proof of Aldous’ Theorem 
2 may be found in [2].) We will comment on this reversed perspective at the end 
of the paper. 

Let M be a solution to the fixed point equation. Write (fl, T, P) for the probability 
space on which all the forthcoming random objects are defined. In particular, under 
P, let {T,d,fj,) be a continuum random tree sampled from the distribution M. 

The first main result of this article is the following theorem, which is proved in the 
next section. 

Theorem 1.6. Suppose that M is a law on continuum trees which is a fixed point 
of^. Then there exists a > 0 such that if (T, d, fj,) is sampled according to M then 
{T,ad,fi) has the law of the BCRT. 


Before going further, we will briefly discuss the requirement that M he & measure 
on continuum trees. Let (T, d, p) be sampled according to M. The assumption that 
p is is carried by the leaves of T ensures that any hxed point of ^ is binary. Indeed, 
if p gives positive mass to T \ C{T) then it is clear that we can create non-binary 
branch-points. Given that M is a fixed point of (? and that p is carried by C{T), p 
cannot, in fact, be atomic. Indeed, suppose (for a contradiction) that there exists 
X S T{T) such that p{{x}) > 0. Then, with positive probability, U creates a tree 
which carries positive mass at a non-leaf, contradicting p{T \ CiT)) = 0. 

We now discuss the assumption that p has to give a positive measure to any con¬ 
nected subcomponent of the tree. Recall that the BCRT (T, d, m) is encoded by 
(2e(t),0 < t < 1), where e is a standard Brownian excursion. Consider an inde¬ 
pendent Poisson point process (PPP) on [0,1] x [0, oo) with intensity ds ® x~^dx. 
For each point (s, x) of the PPP graft a massless branch of length x to the point 
of T corresponding to s under the canonical projection r (note that t(s) is almost 
surely a leaf). As there are almost surely only finitely many of these branches hav¬ 
ing length longer than any e > 0, this construction yields a compact metric space 
and, therefore, induces a probability distribution on the set of measured K-trees. 
A simple computation shows that this distribution is a solution of the fixed point 
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equation which is clearly not isometric to the BCRT. However, it seems reasonable 
to want to exclude such non-continuum tree-valued solutions. 

Our second main result is as follows. 

Theorem 1.7. Suppose that M is a law on continuum trees such that if (T, d, fi) ^ 
M and, given (T, d, p), Vi, V 2 are sampled independently from p,, then E [d(Vi, V 2 )] 
exists and is equal to •\/ 7 r/ 2 . Let Mn = S'"M. Then 

as n ^ 00 , in the sense of weak convergence of measures with the Gromov- 
Prokhorov topology. 


Note that if E [diVi, V 2 )] = for some a ^ 1 then the same result holds 

on multiplying the metric d by a. We emphasize that there is no need for M to 
be a law on binary continuum trees. For example, M could be the law of a stable 
tree of parameter in ( 1 , 2 ) (which has only infinitary branch-points, almost surely). 
Theorem 1 1.7| is proved in Section 3. 


2. Uniqueness of the fixed point: proof of Theorem |1.6| 

We will prove Theorem |1.6| via random finite-dimensional distributions and Theo¬ 
rem |1.4| We start by thinking about the distance between two uniformly-chosen 
points. Throughout this section, we suppose that M is a measure on continuum 
trees which is a fixed point of tg. We write S(m),m > 2 for the reduced trees of a 
tree (T, d, fi) sampled according to M. 


2.1. Two-point distances. Suppose that {T,d,pi) is sampled from M and let D 
be the distance between two points of (T, d) sampled independently according to 
T- 

Proposition 2.1. There exists a constant a > 0 such that aD has the Rayleigh 
distribution. 


Proof. Suppose that {Ti,di, p,i), (T 2 ,d 2 , ^^ 2 ) and {T 3 , ds, p. 3 ) are sampled indepen¬ 
dently from M. Apply ^ with A = (Ai, A 2 , A 3 ) to obtain a new tree (T, d, pf) ~ M. 
Suppose now that we sample two points independently according to p. Let Pi, P 2 
and P 3 be the number of these points falling in the subtrees of T corresponding 
to Ti, T 2 and T 3 respectively. Then, conditional on A, we have {Pi,P 2 ,P'i) ^ 
Multinomial(2; Ai, A 2 , A 3 ). Let D be the distance between the two points. Then 

(3) D = •\/AiI1i1{p^>o} + ■\/^-C)2 l{P2>o} + •\/A3ll3l{P3>o}) 

where Di, D 2 and P 3 are three independent copies of D, independent of everything 
else on the right-hand side, corresponding to the distances between two uniformly- 
chosen points in each of the three subtrees. Let Wk = -\/Afel{p^>o}j k = 1,2,3. 
Then this is precisely the setting of the smoothing transform studied by Durrett 
and Liggett m- In that paper, it is shown that the nature of the family of solutions 
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to such distributional fixed point equations depends on the analytic properties of a 
certain function depending on the moments of Wi, W 2 , W 3 : for s > 0 , let 


p(s) = log 




}] 


By symmetry, p(s) = log (3E [Vbf l{vyj>o}]) • Now, P (Pi > 0|Ai) = Af + 2Ai(l — 
Ai) = 2Ai — Af. Since Ai ^ Beta(l/2,1), we obtain 


Hence, 


E = 2E Af 


v{s) = log 


+1 


-E 


Af 


+2 


s + 5 


3(s + 7) 


_ (s + 3)(s + 5) ^ 

which is finite for all s > 0 and has its unique zero in s > 0 at s = 1. Moreover, 
v'{\) = —1124, < 0. Theorems 1 and 2 of [TT] then entail that the equation (|^ has 
a unique fixed point, up to a constant scaling factor. Finally, the distance between 
two uniformly chosen points in a BCRT has the Rayleigh distribution and that 
must be a solution to (|3|. Define a by the relation E [D] = oT^ \pKj2. Since the 
Rayleigh distribution has mean •\/ 7 r/ 2 , this concludes the proof. □ 


For future reference, we write Usm for the operator which takes the law of a non¬ 
negative real-valued random variable D' and associates to it the law of 

WiD[ + W 2 P 2 + 

where D[, P 3 are three independent copies of D', independent of everything 

else on the right-hand side, and where bFi, VF 2 , H 3 are exactly as above. 


2.2. A coupling. Having determined the distribution of 5(2) (which, of course, 
has trivial tree-shape), we now want to determine the distribution of the reduced 
trees S(m),m > 3. In order to do so, we proceed by coupling a tree T distributed 
according to M and a realisation T of the BCRT, using the operator We will, 
in fact, find it convenient to set up this coupling more generally. Indeed, fix n > 0 
and let M' be a general law on continuum trees (which is not necessarily a fixed 
point of ^). Now let T ^ we will produce a coupling of T and T. 

Before we can describe this coupling, we need to establish some notation. For 
n > 0, let E = U“q{1,2, 3}* be the set of words on the alphabet {1,2,3} where, 
by convention, {1,2,3}° = {0} is the set containing the empty word. Let E„ = 
U(Lq{1,2,3}* be the set of words with at most n letters. For i e S, write |i| for 
the length of the word i. For 1 < m < |i|, write for the mth letter of i and 
i[m] = ii ■ ■ ■ for the prefix consisting of the first m letters of i. 

Fix n > 0 and start from a family (Ti)iG{i, 2 , 3 }"+i of independent continuum 
random trees with common law M', and a family (Ti)ig{i, 2 , 3 }"+i of 3""'"^ inde¬ 
pendent copies of the BCRT. We will refer to these as the input trees and will 
use them and successive applications of ^ in order to build the trees T = Tq, and 
T = Tjj. At each application of U, we will use the same scaling factors and glue 
together subtrees with the same labels. More precisely, let (A(‘))igs and (t/^*^)igE 










Figure 2. Example of the construction of from some input 
trees (the rescaling is omitted here). For instance, here, _ 

3, L(2.2.3) ^ ^(2,3,3) ^ 1 and L(0'2.3) ^ gp 


be independent families of independent random variables where, 


^ 3 ^) ~ Dir(l/2,1/2,1/2) and = (C/f \ 


172 *^ and are independent uniform random variables on [0,1]. 


r(i) 


for each i G 

(i; 

3 


), where U\ 


S, 

(i) 


The families (T'i)iGE„ and (TDigs^ are constructed recursively as follows. The tree 
Tj (resp. Ti) is constructed by applying g" to Tn, Ti 2 and Ti 3 (resp. Tii, Ti 2 and Tis) 
with scaling factors A^/^ A 2 ^ and Ag^ where we emphasize that the same scaling 
factors are used to construct both families. In each of the trees Tji, T \2 and Tjg 
(resp. Til, T \2 and Tia), we need to pick a uniform point which tells us where to 
glue them together (once rescaled) to form Ti (resp. Tj). But if |i| < n, we will also 
want to keep track of where these uniform points sit in the trees at level n + 1. We 
can split this problem into two parts: first finding the label of the subtree at level 




10 


THE BROWNIAN CRT AS A FIXED POINT 


n in which a particular uniform point lies, and then hnding where precisely within 
that subtree it sits. We will use the random variables to determine the 

label of the subtree, and the exact location of the point is then a uniform pick from 
that subtree. We will use the same labels in T and T but independent picks from 
the respective subtrees chosen. 


Let Afc = for 1 < fc < 3. For i € \ {0} and 1 < fc < 3, recursively 

define Aj^ := AiA^^^ In addition, for j G S, write Aj‘^ := Ay/Aj. For i G S„, let 
j G S„_|i|+i. By construction, Tj has a subtree which is equal to Ty, up to rescaling 


and c?Tij by Aj‘^ and y Aj'^ respectively. When the context is clear, we ignore 
the rescaling and refer to this subtree as Ty. Then the probability that a uniform 
point in Ti belongs to the subtree Ty is equal to Aj'^ For i G and 1 < fc < 3, we 
define a word of length n — |i| such that represents the label 

of the input tree at level n + 1 in which the uniform point sampled in To- sits. It is 
convenient to use a random recursive partition of the interval [0,1] to choose this 
point. The left boundaries of the intervals of this partition are defined by 


= 0 , 








(ifc) 


B 


3 



+ A 


(ifc) 

2 


Recursively, for j G S„_|i|_i\{0}, let 


n(i.fc) _ pj(i,fc) 

■°jl “ "j ’ 


■°j 2 - "j 


A (ifc) 

^jl > 


r,(i,fc) _ n(i.fc) I A (ifc) I A (ifc) 

j3 “ -“j ^j2 ■ 


For 0 < £ < n — |i| — 1, if = j then let 


7- (i,fc,n+l) 
-^€+1 


(1 if < (7^ < 

< 2 if < C/^ < 

[3 ifRjf)<C/f. 


Observe that the definition of depends only on and (C/*'‘^)ies„ ■ 


So, finally, when we sample the uniform point in Tik (resp. in To-) needed to create 
T; (resp. Ti), the value of 7 j(*>fc.n+i) gives the index of the input tree in which the 
point sits. Then, conditionally on this choice, we sample the point uniformly from 
T;fc^(i,fc,„+i) (resp. Tij.j^(i,fc,„+i)). 


Certain statistics of the trees constructed by this coupling depend only on the 
scaling factors and not on the input trees. These statistics are identical for the 
two trees. Moreover, because the construction can be performed consistently for 
different values of n, we can make sense of an infinite version of it as a projective 
limit, which results in a family (^(*7,"+!)^ i G S„, 1 < fc < 3, n > 0) of labels which 
encode the gluing points all the way down. 


2.3. The reduced trees. Now, consider 5(3). Again, in this case, the tree-shape 
is deterministic. We will show that the lengths of the three branches can each be 
expressed as sums of rescaled distances between pairs of uniform points. Fix n and 
consider (T, d,fi)^M to be constructed as in the previous section after recursive 
applications of S' to level n -I- 1. We sample three new independent uniform points 
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from T. We wish to determine whether their branch-point in T has been used as a 
gluing point in the construction of T and, if so, at which step of the construction. 

If, when we decompose T into its three subtrees Ti, T 2 and Ta, the three new points 
all happen to fall into different subtrees, then their branch-point is determined and 
is the point G used to glue Ti, T 2 and together. However, if at least two points 
fall into the same subtree, say Ti, we must then further decompose Ti in order 
to try to determine the location of the branch-point. We continue this process 
recursively until either (a) the three points all fall into different subtrees or (b) we 
reach level n + 1. Now observe that the probability that the points are separated 
depends only on the sizes of the subtrees and not on the underlying structure of the 
trees. In particular, this means that we can use the infinite version of our coupling. 
So let N 3 be the smallest value A: > I such that the branch-point between our three 
uniform points is a gluing point at level k in the infinite coupling. More generally, 
let Nm be the smallest value k > 1 such that the branch-points between our m 
uniform points are all determined as gluing points at levels at most k. 

Proposition 2.2. Form > 3, < 00 almost surely. In particular, V = k) = 

Proof. We proceed by induction on m and start with the case m = 3. There are 
three possibilities for the way in which the three points are distributed amongst the 
subtrees Ti, T 2 and T 3 : 

(1) The three points fall in different subtrees. 

(2) All three points fall in the same subtree. 

(3) Two points fall in the same subtree and the remaining point falls in a 
different subtree. 

In case (I), as observed above, the branch-point is necessarily G. In case (2), we 
have a new independent copy of the original problem of finding the branch-point 
between three points chosen uniformly from a copy of T. In case (3), the branch¬ 
point we seek is the same as that between G and the two uniform points which fell in 
the same subtree. But G is also a uniformly chosen point in that subtree. So again, 
it remains to find the branch-point between three points chosen uniformly from a 
copy of T. Indeed, unless case (1) occurs, we recursively obtain a new (independent) 
copy of the original problem. (See Figurej^for an illustration.) Since case (1) occurs 
with strictly positive probability, it follows that N 3 is a geometric random variable. 
The probability that the three points fall in different subtrees at any step is given 

by 

6 E [A 1 A 2 A 3 ] = —, 

and so we obtain P ~ (if)^ k > 1 . 

For m > 4, we proceed by induction. It will be convenient to define N 2 = 0. 

Suppose that < 00 almost surely for 3 < £ < m — 1 . There are again three 

possibilities for the distribution of m uniform points amongst the subtrees Ti, T 2 

and Tg: 
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Figure 3. Finding the branch-point between three uniform 
points. One point falls in T 2 and two fall in Ta, so we must further 
decompose T 3 . We have an independent copy of the original prob¬ 
lem in T 3 (where one of the points considered is now G). One of 
the points falls in T 33 and the other two in T 31 , so we must further 
decompose T 31 . One of the points now falls in but the two 
others are in T 311 , so we repeat in T^n. Finally, the three points 
fall in different subtrees of T 311 and so we obtain = 4. 

( 1 ) At least two points fall in different subtrees from the rest. 

(2) All m points fall in the same subtree. 

(3) TO — 1 points fall in the same subtree and the remaining point falls in a 
different subtree. 

In cases (2) and (3), we obtain again a new copy of the same problem. In case (1), 
we get two or three independent copies of a problem of strictly smaller size. Again, 
case ( 1 ) occurs with strictly positive probability at each level, and so we have a 
geometric number of trials, Nm say, until it does. Then Nm < 00 almost surely. On 
{Nm < 00}) there are random variables Ai, A 2 , A3 such that 2 < A 2 < Ai < to — 2 , 
0 < A 3 < A 2 and Ai -I- A 2 -|- A 3 = to, which represent the numbers of points falling 
in different subtrees (in decreasing order). Then the remaining number of levels we 
have to explore in order to separate all of the points has the same distribution as 

max { 1 + 1 , -hi, -hi} , 

where the three random variables in the maximum are conditionally independent 
given Ai, A 2 , A 3 . Since A 3 -I-I < A 2 -I-I < Ai-l-1 < to— 1, it follows straightforwardly 
that < 00 almost surely. The result then follows by induction on to. 


□ 


Proposition 2.3. {S{m),m > 2) have the same joint distribution as the reduced 
trees (Tl{m),m > 2) of the BCRT. 





THE BROWNIAN CRT AS A FIXED POINT 


13 


Proof. Fix e > 0 and m > 2. By Proposition |2.2[ there exists n sufficiently large 
that we have P {Nm <11 + 1) > 1 — e. Consider the trees T and T constructed by 
the above coupling to recursion depth n + 1, so that T is distributed according to 
M and T according to the law of the BCRT. Consider m points picked uniformly 
in T and T, where we couple the choice of these points in such a way that they fall 
in subtrees with same label in T and T (this is completely analogous to the way 
we couple the branch-points with the random variables Ui in the previous section). 
On the event {Nm < n 1}, each branch-point of S{m) corresponds to a point 
at which we have glued input trees together. In particular, the shapes of S{m) 
and of the reduced tree S{m) in T are the same by construction. Moreover, the 
lengths of corresponding segments of S{m) and S{m) are all made up of sums of 
scaled distances between pairs of uniform points in trees with the same labels at 
level n + 1 and the same scaling factors. (Note that these scaling factors receive 
appropriate biases from the fact that uniform points have/have not fallen into the 
corresponding trees, but this affects only the scaling factors and not the underlying 
trees since, by construction, the trees and scaling factors are independent.) By 
Proposition 2.1 these distances have the same law in T and T. The result follows 
since e was arbitrary and S{m) = TZ(rn). □ 


In view of Theorem 1.4 Theorem 1.6 follows. 


3. Convergence to the fixed point: proof of Theorem 11.71 

Recall that M is now an arbitrary law on continuum trees. For n > 0, let 
{Tn,dn, Hn) ^ and, conditionally on {Tn,dn, Hn), let Vi,V 2 ,... be i.i.d. 

points of Tn sampled according to /i„. Similarly, let (T, d,m) ^ M and, condi¬ 
tionally on (T,(d,m), let Vi, V 2 ,... be i.i.d. points of T sampled according to m. 
Write fin for the law of dniVi^Vf^) and m for the law of d(Vi,V 2 ) (which is, of 
course, Rayleigh). Let 5„(m) be the reduced tree of spanned by R",... 
and TZ{m) be the reduced tree of T spanned by Vi,... ,Vm. Convergence in the 
Gromov-Prokhorov distance is then equivalent to the convergence 

Sn{m) — ^ TZ{m) as n —)■ cx) 

for each m > 2 (see Greven, Pfaffelhuber and Winter na, or the introduction to 
Bertoin and Miermont m)- 

We will again use the coupling of Subsection |2.2| to prove this. Indeed, for fixed 
m > 3, we must look to recursion depth Nm in order to separate our m uniform 
points. For fixed e > 0, by Proposition |2.2[ we can find k sufficiently large that 
P {Nm < k) > \ — e. We work on the event {Nm < kf. Then, for n > A:, in order 
to obtain coupled trees distributed as M and Mn respectively, we need to “plug 
in” 3^' input trees at level k in the coupling, sampled according to M and Mn-k, 
respectively. Moreover, as in the proof of Proposition |2.3[ the lengths of the edges 
of the reduced trees can then be viewed as sums of scaled distances between uniform 
points in these trees with distributions M and Mn-k- So we need to control the 
distribution fin-k of the distance between two uniform points in a tree distributed 
as Mn-k- Note that f-n-k = the (n — A:)-fold iterate of the smoothing 
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transform applied to the law /to of the distance between two uniformly sampled 
points of To ~ M. Theorem 2(b) of Durrett and Liggett [TT] gives conditions under 
which repeated applications of the smoothing transform yields convergence to a 
fixed point. Recall the function v from the proof of Proposition 2.1 Then the 


conditions of Durrett and Liggett’s theorem are that (a) v has its unique zero in 
s > 0 at s = 1 (b) that v'(X) < 0 and (c) that the law to which we repeatedly 
apply should have the same mean as the fixed point. We already checked (a) 
and (b) in the course of the proof of Proposition 2.1 Moreover, by assumption, 


/p°° a:/io(dx) = E [do(^i°) 1 ^ 2 °)] = \A72 = /q°° a;m(dx), so that (c) also holds. We 
conclude that, for fixed k, we have jln-k = t m as n —>■ oo. 


The edge-lengths in Sn(jn) can then be written as sums of randomly rescaled in¬ 
dependent random variables sampled from jln-k- It is then clear (since we use the 
same random scaling factors in order to construct both) that the edge-lengths of 
Snijn) converge in distribution to those of TZ(m) on the event {Nm < fc} for any 
fixed A: > 1. Since e > 0 was arbitrary, the result follows. □ 


4. Concluding remarks 


4.1. Related work. As mentioned in Subsection 1.4 Aldous [6] shows that, in a 
sense, we can “reverse” the operator 5^. Indeed, we can decompose a BCRT by pick¬ 
ing three uniform points and splitting at the branch-point between them; we obtain 
three independent BCRT’s, Brownian-rescaled by (Ai, A 2 , A 3 ) ^ Dir(l/2,1/2,1/2). 
Each of these subtrees is doubly marked, one mark being the original uniform point 
and the other being the former branch-point. Perhaps a more natural way of phras¬ 
ing the reversal, which yields only a single mark in each subtree, would be to pick 
each of the branch-points in the tree with probability given by 6 times the product 
of the masses of the subtrees into which removal of that branch-point splits the 
tree. 


If we do use three uniforms to pick the branch-point then the two marks in each 
subtree are independent uniform picks from that subtree. This decomposition op¬ 
eration is used recursively by Croydon and Hambly [5] to prove that the BCRT is 
homeomorphic to a certain deterministic fractal with a random self-similar metric, 
along with the naturally-associated measure. In the course of their proof, they show 
(Lemma 10(d) of [3]) that all of the randomness in the BCRT is contained in an 
i.i.d. family of Dir(l/2,1/2,1/2) scaling factors (A;, i G S). 

Although we have referred to this decomposition of the BCRT as the reverse of 
our operator there is, in fact, a rather subtle difference which arises concerning 
marking and labelling. The forward version of Croydon and Hambly’s splitting 
operator acts on doubly uniformly marked trees and can be paraphrased as follows: 
take three independent BCRT’s, Ti,T 2 ,T 3 , each with two independent uniform 
points, labelled 1 and 2. Rescale these trees according to the appropriate Dirichlet 
random vector and glue them together at the points labelled 1. Now relabel the 
point labelled 2 in Ti by 1 , keep the point labelled 2 in T 2 and forget the point 
labelled 2 in T 3 as well as the branch-point just created. Then this is again a doubly 
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uniformly marked BCRT. This seems to us a much less natural “forward” operation 
on continuum trees than the one pursued in this paper, but it has the advantage 
that the recursive decomposition obtained by going backwards does not have any 
of the labelling issues encountered in Section [2?2| Indeed, in this version there is 
no randomness in which subtree attaches to which other subtree. 


4.2. Convergence. The distributional convergence in Theorem |1.7| is in the sense 
of the Gromov-Prokhorov distance which, for example, does not distinguish be¬ 
tween the BCRT and the BCRT decorated by the independent PPP discussed after 
Theorem 1.6 In particular, this convergence is equivalent to the convergence in 
distribution of the random finite dimensional distributions. It would be interest¬ 
ing to find conditions under which the convergence holds instead in the stronger 
Gromov-Hausdorff-Prokhorov sense; in particular, we would need a certain tight¬ 
ness condition to hold (see Corollary 19 of [S]). 
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